Using Intel Streaming SIMD Extensions for 3D Geometry Processing

نویسندگان

  • Wan-Chun Ma
  • Chia-Lin Yang
چکیده

Three dimensional (3D) graphics applications is an important workload running on today’s computer system. A cost-effective graphics solution is to use a general processor for 3D geometry processing and a specialized hardware for rasterization. 3D geometry processing is an inherently parallel task. Therefore, many CPU vendors add SIMD (Single Instruction Multiple Data) instruction extensions to accelerate 3D geometry processing. In this paper, we evaluate the performance impact of using the Intel Streaming SIMD Extensions (SSE) for 3D geometry processing. We use SIMD-FP to improve the computational throughput by processing four vertices in parallel. We find that the layout of vertices in memory is important for the effectiveness of SIMD-FP. We also study the effect of using prefetch instructions to improve the memory performance. The experimental results show that using Intel SSE can achieve close to 4x speedup for geometry processing.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Optimized CPU-based Skinning for 3D Games

Introduction Lifelike 3D character models play an increasingly important role in many computer games. Organic models, such as people, are more complex to render than rigid bodies because the mesh that defines the shape of the model constantly changes as the model animates. This animating mesh is referred to as a ‘skin’ since it’s influenced by the underlying structure of the object; ‘skinning’ ...

متن کامل

Real-time full-programmable fuzzy processor on an Intel Pentium III

This paper presents a real-time fullprogrammable fuzzy processor using piecewiselinear intelpolation techniques and implements it using the SSE (Streaming SIMD Extensions) set of instructions of an Intel Pentium III. A fullprogrammable fuzzy processor is defined as a system where the set of rules, the membership finctions, the t-norm, the t-conorm, the aggregation operator, the propagation oper...

متن کامل

An Implementation of Parallel 1-D FFT Using SSE3 Instructions on Dual-Core Processors

In the present paper, an implementation of a parallel one-dimensional fast Fourier transform (FFT) using Streaming SIMD Extensions 3 (SSE3) instructions on dual-core processors is proposed. Combination of vectorization and the block six-step FFT algorithm is shown to effectively improve performance. The performance results for one-dimensional FFTs on dual-core Intel Xeon processors are reported...

متن کامل

The Sse Provides a Rich Set of Instructions to Meet the Requirements of Demanding

0272-1732/00/$10.00  2000 IEEE In the volume PC market, the demand is growing for general-purpose processors that facilitate visual and graphical computing on the Internet. Responding to this need, Intel developed the streaming SIMD extensions (SSE), a set of processor instructions designed to boost performance of multimedia and Internet applications, and implemented them on its Pentium III pr...

متن کامل

Fast Trigonometric Functions Using Intel’s Sse2 Instructions

The goal of this work was to answer one simple question: given that the trigonometric functions take hundreds of clock cycles to execute on a Pentium IV, can they be computed faster, especially given that all Intel processors now have fast floating-point hardware? The streaming SIMD extensions (SSE/SSE2) in every Pentium III and IV provide both scalar and vector modes of computation, so it has ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002